A quantitative model of trading and price formation in financial markets 
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We use standard physics techniques to model trading and price formation in a market under the 
assumption that order arrival and cancellations are Poisson random processes. This model makes 
testable predictions for the most basic properties of a market, such as the diffusion rate of prices, 
which is the standard measure of financial risk, and the spread and price impact functions, which 
are the main determinants of transaction cost. Guided by dimensional analysis, simulation, and 
mean field theory, we find scaling relations in terms of order flow rates. We show that even under 
completely random order flow the need to store supply and demand to facilitate trading induces 
anomalous diffusion and temporal structure in prices. 
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There have recently been efforts to apply physics meth- 
ods to problems in economics |^. This effort has 
yielded interesting empirical analyses and conceptual 
models. However, with the exception of refinements to 
option pricing theory, so far it has had little success 
in producing theories that make falsifiable predictions 
about markets. In this paper we develop a mechanistic 
model of a standard method for trade matching in order 
to make quantitative predictions about the most basic 
properties of markets. This model differs from standard 
models in economics in that we make no assumptions 
about agent rationality. The model makes falsifiable pre- 
dictions based on parameters that can all be measured 
in real data, and preliminary results indicate that it has 
substantial explanatory power 

The random walk model was originally introduced by 
Bachelier to describe prices, five years before Einstein 
used it to model Brownian motion Q]. Although this is 
one of the most widely used models of prices in finan- 
cial economics, there is still no understanding of its most 
basic property, namely, its diffusion rate. We present a 
theory for how the diffusion rate of prices depends on 
the flow of orders into the market, deriving scaling rela- 
tions based on dimensional analysis, mean field theory, 
and simulation. We also make predictions of other ba- 
sic market properties, such as the gap between the best 
prices for buying and selling, the density of stored de- 
mand vs. price, and the impact of trading on prices. 

Most modern financial markets operate continuously. 
The mismatch between buyers and sellers that typically 
exists at any given instant is solved via an order-based 
market with two basic kinds of orders. Impatient traders 
submit market orders, which are requests to buy or sell a 
given number of shares immediately at the best available 
price. More patient traders submit limit orders, which 
also state a limit price, corresponding to the worst allow- 
able price for the transaction. Limit orders often fail to 
result in an immediate transaction, and are stored in a 
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queue called the limit order hook. Buy limit orders are 
called bids, and sell limit orders are called offers or asks. 
We will label the logarithm of the best (lowest) offering 
price a{t) and the best (highest) bid price b{t). There is 
typically a non-zero price gap between them, called the 
spread s[t) = a(t) — b{t). 

As market orders arrive they are matched against limit 
orders of the opposite sign in order of price and arrival 
time. Because orders are placed for varying numbers of 
shares, matching is not necessarily one-to-one. For ex- 
ample, suppose the best offer is for 200 shares at $60 and 
the the next best is for 300 shares at $60.25; a buy mar- 
ket order for 250 shares buys 200 shares at $60 and 50 
shares at $60.25, moving the best offer a{t) from $60 to 
$60.25. A high density per price of limit orders results 
in high liquidity for market orders, i.e., it implies a small 
price movement when a market order of a given size is 
placed. 

We analyze the queueing properties of such order- 
matching algorithms with the simple random order place- 
ment model shown in Fig. |^. All the order flows are mod- 
eled as Poisson processes. We assume that market orders 
in chunks of a shares arrive at a rate of /i shares per unit 
time, with an equal probability for buy orders and sell 
orders. Similarly, limit orders in chunks of a shares ar- 
rive at a rate of a shares per unit price and per unit time. 
Offers are placed with uniform probability at integer mul- 
tiples of a tick size dp in the range b{t) < p < oo, and 
similarly for bids on — oo < p < a{t). p represents the 
logarithm of the price, and dp is a logarithmic price inter- 
val Ip^ . (To avoid repetition the word price will hence- 
forth refer to the logarithm of price.) When a market 
order arrives it causes a transaction. Under the assump- 
tion of constant order size, a buy market order removes 
an offer at price a(t), and a sell market order removes a 
bid at price b{t). Alternatively, limit orders can be re- 
moved spontaneously by being canceled or by expiring. 
We model this by letting any order be removed randomly 
with constant probability i5 per unit time. 

This order placement process is designed to permit an 
analytic solution. The model builds on previous work 
modeling the continuous double auction [|[ ||, 0, |[ ||, 
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FIG. 1: Schematic of the order-placement process. Stored 
hmit orders are shown "stacked" along the price axis, with 
bids (buy limit orders) negative and offers (sell limit orders) 
positive. New limit orders are visualized as "falling" randomly 
onto the price axis. New offers can be placed at any price 
greater than the best bid, and new bids can be placed at any 
price less than the best offer. Limit orders can be removed 
by random spontaneous deletion or by market orders of the 
opposite sign. 



p^ , pl| . While the assumption of hmit order placement 
over an infinite interval is clearly unrealistic it 
provides a tractable boundary condition for modeling the 
behavior of the limit order book in the region of interest, 
near the midpoint price m(t) = (a(0 + b{t))/2. It is 
also justified because limit orders placed far from the 
midpoint usually expire or are canceled before they are 
executed. For our analytic model we use a constant order 
size a. In simulations we also use variable order size, 
e.g. half-normal distributions with standard deviation 
•y/2/7r(T, which gives similar results. 

For simplicity in our model we do not directly allow 
limit orders that cross the best price. For example, a 
buy order of size x + y may have a limit price that is 
higher than the best ask, so that x shares immediately 
result in a trade, and y shares remain on the book. Such 
an order is indistinguishable from a market order for x 
shares immediately followed by a non-crossing limit order 
for y shares. By definition our model implicitly allows 
such events, though it neglects the resulting correlation 
in order placement. 

Dimensional analysis simplifies this problem and pro- 
vides rough estimates of its scaling properties. The 
three fundamental dimensions are shares, price, and time. 
There are five parameters: three order flow rates and two 
discreteness parameters. The order flow rates are fj,, the 
market order arrival rate, with dimensions of shares per 
time; a, the limit order arrival rate per unit price, with 
dimensions of shares per price per time; and S, the rate of 
limit order decays, with dimensions of 1/time. The two 
discreteness parameters are the price tick size dp, with 
dimensions of price, and the order size a, with dimen- 
sions of shares. Because there are five parameters and 
three dimensions, and because the dimensionality of the 
parameters is sufficiently rich, all the properties of the 



limit-order book can be described by functions of two 
independent parameters. 

We perform the dimensional reduction by taking ad- 
vantage of the fact that the effect of the order flow rates 
is primary to that of the discreteness parameters. This 
leads us to construct nondimensional units based on the 
order flow parameters alone, and take nondimensional- 
ized versions of the discreteness parameters as the inde- 
pendent parameters whose effects remain to be under- 
stood. There are three order fiow rates and three fun- 
damental dimensions. Temporarily ignoring the discrete- 
ness parameters, there are unique combinations of the or- 
der flow rates with units of shares, price, and time. These 
define a characteristic number of shares Nc — iJ,/26, a 
characteristic price interval pc — ^/2a, and a charac- 
teristic timescale tc = 1/S. (The factors of two are a 
matter of convenience; they occur because we have de- 
flned the market order rate for either a buy or a sell order 
to be /i/2.) These characteristic values can be used to 
define nondimensional coordinates p — p/pc for price, 
N = N /Nc for shares, and t = t/tc for time. 

A nondimensional scale parameter based on order size 
is constructed by dividing the typical order size a (which 
is measured in shares) by the characteristic number of 
shares Nc, i.e. e = cr/Nc = 25a /(i. e characterizes the 
granularity of the orders stored in the limit order book. 
A nondimensional scale parameter based on tick size is 
constructed by dividing by the characteristic price, i.e. 
dp/Pc = 2adp/ fjL. The theoretical analysis and the simu- 
lations show that there is a sensible continuum limit as 
the tick size dp — > 0, in the sense that there is non-zero 
price diffusion and a finite spread. Furthermore, the de- 
pendence on tick size is usually weak, and for many pur- 
poses the limit dp — > approximates the case of finite 
tick size fairly well. 

Space constraints do not permit us to review the theo- 
retical development of the model in this Letter; it is pre- 
sented in detail in Ref. |13 . We write an approximate 



master equation for the number of shares at each price 
level p at time t, and then find a self-consistent mean 
field theory steady-state solution. We also develop an 
independent interval approximation, borrowing methods 
from the study of reaction-diffusion equations |lj]. We 
find that the theory fits the simulation results accurately 
for large values of e. For small values of e the theory con- 
tinues to capture the mean spread very well. The pre- 
dictions of other properties remain qualitatively correct, 
but are no longer quantitatively accurate. The results 
we quote here are all from simulations; to see the devel- 
opment of the theory and comparisons to simulation, see 
ref. m. 

In the following we explore the predictions of the model 
for the basic properties of markets. As already noted, ne- 
glecting the effects of the discreteness parameters, gives 
three dimensional quantities and three parameters, which 
we call the continuum approximation. For the continuum 
approximation dimensional analysis alone yields simple 
estimates for the most relevant economic properties of 
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FIG. 2: The mean depth profile versus price in nondimen- 
sional coordinates n = npc/Nc = nS/a vs. p — p/pc ~ 2ap/fi. 
The origin p/pc = corresponds to the midpoint. We show 
three different values of the nondimensional granularity pa- 
rameter: e = 0.2 (solid), e = 0.02 (dash), e = 0.002 (dot), all 
with tick size dp = 0. 
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FIG. 3: The average price impact corresponding to the re- 
sults in Fig. (^. The average instantaneous movement of the 
nondimensional mid-price, {dm)/pc caused by an order of size 
N/Nc = Ne/a. e = 0.2 (solid), e = 0.02 (dash), e = 0.002 
(dot). 



the models. We have refined these estimates by simula- 
tion and mean field theory, which take the effects of e 
and dp/pc into account. The results are summarized in 
Table B, and described in more detail below. 

The bid-ask spread is the difference between the best 
price for buying and selling. It is an important deter- 
minant of transaction costs, as it is the price that one 
pays to buy a share and then immediately sell it. The 
spread has dimensions of price and therefore scales under 
the continuum approximation as fx/a. Simulations and 
theory show that the spread varies as {fj,/a)f{e,dp/pc), 
where / is a fairly flat function with f(e,dp/pc) « 1/2 
across much of the range of interest (see ref . ) . 

Another interesting quantity is the average depth pro- 
file n(j>) = {n{p,t)), which is the density of shares per 
price interval. The average depth profile is relatively 
small near the midpoint and increases to an aysymp- 
totic value far from the midpoint, as shown in Fig. |^. 
The approach to an asymptotic value is a consequence 
of our assumption of uniform order placement over an 
infinite range, and is not realistic. It should be viewed as 
a convenient boundary condition for understanding the 
depth near p — 0, where transactions occur. From dimen- 
sional analysis the asymptotic depth, which has units of 
shares/price is a/S. This result is exact. 

An important property of the depth profile is its slope 
near the origin, which determines the price response to 
the placement of a small market order. From continuum 
dimensional analysis the slope of the average depth pro- 
file scales as A ~ c? j [ib. This is altered by effects due to 
the granularity of orders. For large e the depth profile is 
a concave function with nonzero values at p = 0, whereas 
for small e, n(0) « and n{p) is convex near p = 0. 

In addition to the spread, the price response for ex- 
ecuting a market order is also a key factor determining 



transaction costs. It can be characterized by a price im- 
pact function Ap = 0(w, r, t), where Ap is the price shift 
at time t~\-T caused by a market order w at time t. Price 
impact causes market friction, since selling tends to drive 
the price up and buying tends to drive it down, so ex- 
ecuting a circuit causes a loss. (This loss, from chang- 
ing the order distribution, is in addition to the intrinsic 
loss created by a nonzero spread.) The price impact is 
closely related to the demand function, providing a nat- 
ural starting point for theories of statistical or dynamical 
properties of markets ^ . A naive argument predicts 
that the price impact (^(yi) should increase at least lin- 
early: Fractional price changes should not depend on the 
scale of price. Suppose buying a single share raises the 
price by a factor A: > 1. If fc is constant, buying ixi shares 
in succession should raise it by fc". Thus, if buying uj 
shares all at once affects the price at least as much as 
buying them one at a time, the ratio of prices before and 
after impact should increase at least exponentially. Tak- 
ing logarithms implies that the price impact as we have 
defined it above should increase at least linearly. In con- 
trast, from empirical studies ^(ti-') for buy orders grows 
more slowly than linearly ^ |||. A recent 

more accurate study by Lillo et al. demonstrates that for 
the New York Stock Exchange (/) is strongly concave, and 
that in general a simple power law of the form (/'(cj) ^ uj^ 
is not a good approximation . 

Our model gives average instantaneous price impact 
fimctions Ap — {4>{uj, 0, t)) as shown in Fig. ^ The price 
impact approaches a linear function for large e, but for 
smaller values of e it is strongly concave, particularly 
near the midpoint. Plotting this on log-log scale, this 
function does not follow a pure power law. For example, 
for e — 0.002, the exponent is /9 « 0.5 for small orders, 
and /? « 0.2 for larger orders. This is in agreement with 
the results of Lillo et al. The instantaneous price 
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Quantity Dimensions Continuum scaling relation Scaling from simulation and theory 

Asymptotic depth shares /price d = a/5 

Spread price s^^i/a s — {^./a)f{e,dp/pc) 

Slope of depth profile shares /price^ A ^ — d/s A = (a'^ / fj,5)g{e, dp/pc) 

Price diffusion rate price^ /time D ~ fi^S/a (r — » 0, dp 0) Do ~ fi^S /ae'"'^ 

(r ^ oo, dp ^ 0) Doo ~ n'^S/ae^-^ 

TABLE I: Predictions of scaling of market properties vs. order flow. The third column contains predictions from the continuum 
analysis, in which the discreteness parameters are ignored, and the fourth column gives more accurate predictions from theory 
and simulation. The functions / and g are the order of magnitude of one throughout the relevant ranges of variation of e and 
dp/Pc- 
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FIG. 4: The variance of the change in the nondimensional- 
ized midpoint price versus the nondimensional time lag. For 
a pure random walk this would be a straight line whose slope 
is the diffusion rate, which is proportional to the square of 
the volatility. The fact that the slope is steeper for short 
times comes from the nontrivial temporalpersistence of or- 
ders. The three cases correspond to Fig. Ci e = 0.2 (solid), 
e = 0.02 (dash), e = 0.002 (dot). 



impact 0, t) can be understood in terms of the depth 
profile n(p, t), as explained in ref. |p^ . 

The price diffusion rate is a property of central inter- 
est. In finance, it is typically characterized in terms of 
the standard deviation of prices at a particular timescale, 
which is referred to as volatility. Volatility is a measure of 
the uncertainty of price movements and is the standard 
way to characterize risk. In Fig. ^ we plot simulation re- 
sults for the variance of the change in the midpoint price 
at timescale r, i.e. the variance of ■m{t + t) — m{t). The 
slope is the diffusion rate, which at any fixed timescale 
is proportional to the square of the volatility. It appears 
that there are at least two timescales involved, with a 
faster diffusion rate for short timescales and a slower dif- 
fusion rate for long timescales. Such correlated diffusion 
is not predicted by mean- field analysis. Simulation re- 



sults show that the diffusion rate is correctly described by 
the product of the estimate from continuum dimensional 
analysis [i^i/o? , and a r-dependent power of the nondi- 
mensional granularity parameter e = 25a/ as summa- 
rized in table ||. We cannot currently explain why this 
power is —1/2 for short term diffusion and 1/2 for long- 
term diffusion. 

This model contains numerous simplifying assump- 
tions. Nevertheless, it is the very simplicity of this model 
that allows us to make unambiguous predictions about 
the most basic properties of real markets. Our prediction 
for the price impact function agrees with the best empiri- 
cal measurements and suggests that concavity is a robust 
feature deriving from institutional structure, rather than 
rationality [ pl| . Preliminary analysis of data from the 
London Stock Exchange using the nondimensional coor- 
dinates defined here shows an even better collapse of the 
price impact function, suggesting the existence of uni- 
versal supply and demand functions. Futhermore, the 
results indicate that scaling relationships along the lines 
that we predict have remarkable explanatory value for 
both the spread and the price diffusion rate ||^. Even 
though we do not expect the predictions of this model 
to be exact in every detail, they provide a simple bench- 
mark that can guide future improvements. Our model il- 
lustrates how the need to store supply and demand gives 
rise to interesting temporal properties of prices and liq- 
uidity even under assumptions of perfectly random order 
flow, and demonstrates the importance of making realis- 
tic models of market mechanisms. 
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